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1.  Introduction 


Based  on  recent  work  in  image  modeling  [l]-[5]  it  is  reason- 
»ble  to  try  to  derive  real  image  properties  from  the  underlying 
models  assumed  to  qenerate  the  image.  Among  such  properties  are 
expected  perimeter,  average  edge  value,  and  expected  length  of  a 

run . 

The  following  statistical  models  generating  images  are  analyzed 
in  [11-  (4]:  Poisson  line,  occupancy,  and  Delaunay  models.  The 
assumption  is  that  one  of  these  processes  generates  a  good  approxi¬ 
mation  to  a  given  class  of  natural  images.  The  process  which  best 
predicts  the  image  properties  will  be  considered  to  be  the  process 
generating  the  image. 


2. 


E  xpec  t  ed  pen  meter 

Synthetic  images  are  easy  to  divide  into  objects  and  back¬ 
ground.  Thus  the  perimeter  of  the  objects  in  such  images  is  well 
defined.  For  digital  pictures  perimeter  is  measured  by  following 
object  borders  and  counting  vertical  or  horizontal  moves  as  1 
and  diagonal  moves  as  >2.  A  more  accurate  estimate  of  perimeter 
is  the  average  of  border  length  as  traced  in  the  background  (white) 
along  the  neighboring  black  points  and  in  the  object  (black)  along 
the  neighboring  white  points.  Real  images  must  be  thresholded 
first  before  measuring  perimeter  length.  The  choice  of  a  good 
threshold  will  not  be  discussed  here. 


2 . 1  Synthetic  images 

Experiments  with  synthetic  images  were  conducted  using  two 
iinds  of  mosaic  models:  occupancy  and  Delaunay  (the  dual  of 
occupancy).  Poisson  line  mosaics  were  not  considered  due  to  the 
errors  introduced  by  the  thickness  of  the  lines  in  the  digital 
process  of  random  coloring  of  the  Poisson  mosaic. 

Cells  in  the  mosaic  were  colored  randomly  black  with  prob¬ 
ability  p  and  white  with  probability  1-p.  A  regular  tessellation 
is  called  a  KV  tessellation  if  each  cell  has  K  neighbors  and  V 
cells  meet  at  each  vertex.  In  !l]  it  is  conjectured  and  experi¬ 
mentally  confirmed  that  a  random  tessellation  in  which  the  expected 
number  of  neighbors  of  each  cell  is  K  and  the  expected  number  of 
cells  meeting  at  a  vertex  is  V,  has  the  same  expected  number  of 
connected  components  as  a  KV  randomly  colored  regular  tessellation 
provided  that  they  have  the  same  number  of  cells.  Under  this 
assumption  the  total  expected  perimeter  (T.E.P)  of  components  in 
a  mosaic  is  calculated  as  follows: 


T.E.P  =  expected  perimeter  of  all  components 

,  .  , .  . .  expected  cell  perimeter 

in  the  regular  tessellation  x  ixpSctBa-^USBir  ZTsTtes 

of  a  cell 

Mote  that  eacn  grid  point  in  a  regular  tessellation  represents 
one  cell  in  the  corresponding  mosaic. 

Occupancy  model:  Here  the  corresponding  regular  tessellation  is 
a  hexagonal  grid.  Let  S  =  mosaic  area,  X  =  intensity  of  points 


dropped  on  the  plane  to  generate  the  mosaic  nuclei.  Using  the 
formula  above  and  the  expressions  derived  in  (1)  we  get: 

T.E.P  =  p  •  V  •  S  •  6  ( 1-p)  --  •  r  =  4p(l-p)S/X 

*  f\  0 

/X 

■*  is  the  expected  perimeter  in  the  occupancy  process  and  the 

r  \ 

average  number  of  sides  of  a  cell  is  6  (5). 

V  is  computed  as  follows: 

No.  of  cells  in  the  mosaic  X  predicted  No.  of  connected 
components  per  cell  '  observed  No.  of  connected 

components  in  the  mosaic 

Thus: 

V*S*  P.-N^-9.  *  o.N.C.C. 

where  S  =  area  of  mosaic  having  100  cells  (the  same  as  in  Table 
7  111  in  order  to  have  the  same  border  effect). 

P.N.C.C  predicted  No.  of  connected  components  for  the  appropriate 
p  in  the  hexagonal  grid  taken  from  Table  7  in  (1). 

S’  =  10x10  hex igona 1  grid  having  100  cells. 

O.N.C.C  =  observed  No.  of  connected  components  in  the  generated 
mosaic. 

Experimental  results  were  obtained  for  four  mosaics.  See 
Figures  1(a) -1(d)  for  the  tessellations  and  Figures  2 (a) -2(d)  for 
the  corresponding  colored  mosaics.  Final  judgment  must  be  based 
on  the  average  of  these  results. 

The  results  are  given  in  Table  1.  The  average  deviation 
between  observed  and  predicted  perimeter  is  very  small  (2.7%).  The 
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average  computed  A (.0033)  is  in  good  agreement  (10%  deviation) 
with  A=.003  which  war  used  for  generating  the  tessellations. 


Delaunay  model:  The  triangular  tessellation  has  K\  the  same  as 
in  the  occupancy  case  but  with  the  roles  of  K  and  V  m»-er  'hanged — 
namely,  the  number  of  neighbors  of  each  cell  is  K=3  and  the  number 
of  cells  meeting  at  each  vertex  is  V=6.  Here  the  T.E.P.  is  computed 
using  the  same  principle  as  in  the  occupancy  case: 


T.E.P. 


p*  2 • A  *S • 3 ( 1-p) • 


32/X 


1 

3 


2 (1-p) -pS 


32/1 


■  is  the  expected  perimeter  of  a  triangular  cell  and  there  are 
three  edges  per  cell  on  the  average  (5).  A  in  the  formula  above 
is  the  intensity  of  the  dual  occupancy  case  so  A  derived  now, 
with  data  from  Table  2  in  (2] ,  has  to  be  divided  by  2  to  be  used 
for  computing  T.E.P  (>Q  *  .  The  results  are  given  in  Table  2. 

See  Figures  3(a)-1(d)  for  the  tessellations  and  4(a)-4(d)  for  the 
corresponding  colored  mosaics. 

S  was  200x200  and  Ag«.0012  was  used  to  generate  the  tessel¬ 
lation.  The  computed  A  is  within  17%  of  the  observed  one;  this 
is  a  consequence  of  the  small  number  of  measurements  with  high 
variance  used  to  derive  Table  2  in  [2] . 


Figure  3:  Delaunay  tessellations  ( V  .0012) 


2.2  Natural  images 

We  now  deal  with  attempts  to  fit  mosaic  models  to  real 
two-color  images.  First  the  images  are  thresholded  so  as  to 
have  a  definite  border  between  object  and  background.  Then  p, 
the  probability  of  object  points,  is  estimated  by  the  fraction 
'f  .ib’ect  points  in  the  image.  The  number  of  connected  components 
iO.N.C.C)  is  counted  and  perimeter  is  measured.  Obviously  all 
f  the  last  three  parameters  art'  functions  of  the  threshold  and 
it  should  be  selected  so  that  these  para  ters  are  not  sensitive 
to  small  changes  in  the  threshold.  Measurements  were  conducted 
on  the  interior  200x200  portion  of  a  510x480  picture  of  marble 
Brodatz,  U62)  which  is  relatively  sharp  and  easy  to  threshold; 
see  Figures  5(a)— 5(d)  .  The  results  were:  p»0.408,  O.N.C.C.=27, 
observed  perimeter  1937.  Computations  were  done  to  test  three 
image  mosaic  models:  Poisson  line,  occupancy,  and  Delaunay.  The 

results  are  as  follows: 

T^ 

Poisson:  Here  •  _  [5] .  Thus  by  the  considerations  in  Section 

2.1  for  computing  we  derive: 


T  -  /Ot^C,C.  L-_ 


T*  2 "  1  2  ~ 

T .  E .  P  .  =  4  1  -p  )  *i  •  _  )  •  S  •  *■  •  j  2(l-p)  'p*S*T  where  „  is  t  he  expected 
perimeter  of  a  cell.  The  expected  number  of  sides  of  a  cell  is 


Figure  5:  (a)  Brodatz's  162  marble  picture 


(b)-(d):  After  thresholding  at  25,  17, 

and  3f>,  respectively 


Occupancy  ind  Delaunay :  The  corresponding  formulae  from  Section 
2 . 1  were  used . 

The  results  for  three  different  thresholds  for  the  Poisson 
line,  occupancy  and  Delaunay  models  are  presented  in  Tables  3-5 
respectively.  The  Poisson  line  model  is  clearly  much  better 
than  the  occupancy  and  Delaunay  models,  as  might  be  guessed  by 
1  king  at  the  picture.  The  Poisson  line  results  are  also  less 
•nsitive  to  variation  of  the  threshold. 


Table  3 


Poisson  line  model 


Thresho Id 

0 .  N  .  C  •  C . 

.  _____  ~  _ .  - . 

i 

p 

X 

j  Observed 
perimeter 

Computed 

perimeter 

Deviation 

25 

- — 

27 

.408 

.0062 

2013 

2695 

+33.8% 

36 

.494 

.0083 

2520 

324  3 

i  +29.7% 

36 

19 

.299  j 

.0044 

1453 

1975 

1  +35.9% 

Table  4:  Occupancy  model 


.  4  H 

I - 

1  iT* 

1 

r—4 

O 

l _ _ 

2013 

r - 

8250 

309% 

36 

.  494 

.0489 

2520 

8842 

250% 

19 

.  299 

.00549 

1453 

2485 

71% 

Table  5:  Delaunay  model 


25 

27 

.408 

r — - -  —  i 

.00412 

- - - - 

2013 

8055 

300% 

17 

36 

.494 

.0066 

2520 

5522 

119% 

36 

19 

.  299 

.  00285 

1453 

3045 

i  109% 

!.  Kxpected  edge  value 

In  this  section  we  discuss  the  possibility  of  applying  the 
analysis  of  Section  2  to  predicting  the  distribution  of  edge 
values  (e.g.,  gradient  magnitudes)  for  an  image .  it  is  concluded 
that  such  predictions  would  not  be  very  accurate. 

We  can  view  an  image  as  composed  of  three  basic  parts: 
ibiect,  background  and  border  points.  If  we  consider  the  border 
t  be  sharp  (this  assumption  will  be  discussed  later),  we  measure 

•he  perimeter  and  compute  the  fraction  of  border  points  per  unit  area 

per  met  er 
as  q  =  r  s 

Let  stand  for  the  edge  operator;  then  the  expected  value 

of  '  for  the  image  is  given  by 

E  (  * )  tBC  )  ( l-p)  ♦Eq(A)  (p-qH-qE^tM 

where  E„(.\)  expected  background  edqe  value 
B 

Eq(’)  t:xpected  object  edge  value 

I*  ( )  *  expected  border  edge  value. 

BO 

The  term  (p-q)  is  used  to  correct  for  considering  border  points 
to  be  part  of  the  object. 

If  we  use  a  ono-dimensionil  edge  detector  in  the  horizontal 

r  1  ith?  tor 

direction,  for  simplicity,  then  q  *  ”  since  on  the  average 

nly  half  of  the  border  segments  are  vertical. 

Consider  now  a  statistical  model  in  which  the  object  and  back¬ 
ground  regions,  in  a  picture  P,  are  generated  by  independent  random 
processes.  The  border  is  a  result  of  interaction  between  them. 


In  fact  we  will  compute  the  probability  distribution  of  edge 
values  in  each  region  of  P:  (liven  two  P  regions,  one  with  prob¬ 
ability  distribution  p^(f.)  and  the  other  with  p2<f2 ),  we  will 

compute  the  p.d.f.  of  f  0  -  f  ^  (edge  value  in  P)  ,  where  fj  and 

f,  are  grey  level  values  in  regions  1  and  2  respectively. 

Now  ptjf.-f-  -d)  -  p.  (  f  ' )  [p.  (  f  '  +d)  +p?  (f ' -d)  J  . 

1  z  owf'fc-  1  ^ 

Assuming  the  p.d.f. 's  are  continuous  and  setting  x=f',  we  get 
p(  f  ^ - f  ^  3ci)  /  Pj^  (x)  lp2  (x+d) +p2  (x-d)  )dx. 

If  pj  and  p?  are  and  respectively,  we 

•  let  : 


p(  ^  2 


KC 


,  -  id^-Bd-  »  -  id"  +  nd-  ( 

[e  +e 


where 


C, 


ury2 


1  ”  2( ■2+'2) 
1  2] 


<oJ+o2) 


-  ;*■;> 


✓rropop 


K 


_  1 _ / 

2  - N  (B)  -nT-IT  *’  2 


NIB)  ■  FN(0,1I IXiBI 

K  is  the  normalization  factor  of  p(  f^-f,  =d) 

Computing  the  expected  value  of  P( ! f j-f2 | *d)  over  the  entire 
;ray  level  range  f0,®|  wo  get: 

E  ( A )  -  2K  (.■ 2  ♦  1“w2 )  (N(B)-N(-B))  ] 

whore  *.  -  fi~^2 


1 1 


, ^  ,  we  have  B -O  so  that  N(B)  N(-B); 
while  if  ,  we  have  B-^O  so  that  N(B)<N(-B). 

Therefore  the  second  term  in  E(.‘.)  is  always  positive  and  thus 

so  is  E(.\ ). 

Assuming  that  the  gray  levels  at  points  at  distance  at  least 
2  apart  in  the  discrete  P  are  independently  distributed  according 
to  N  '^)  or  N  ( u  j »  '  -> )  »  K(’)  is  also  true  inside  these  regions 

provided  we  set  and  In  this  case  we  get  E1  (A)  =2  ^ 

for  i  =  l,2.  In  case  j='/-0,  the  formula  for  E(A)  is  not  valid 

a  no  E  ( A )  =  i  -  l )  . 

1  *■ 

Unfortunately  this  model  for  averatje  edge  value  estimation  l 
not  accurate  since  g,  which  depends  on  the  perimeter,  is  not 
accurate.  The  reason  is  that  perimeter  measurement  is  very  sen¬ 
sitive  to  the  threshold  applied  to  a  picture,  and  yet  there  is 
n  precise  way  to  determine  the  correct  threshold.  Moreover, 
counting  the  number  of  connected  components  in  the  case  of  natural 
images  is  not  accurate  since  we  don’t  know  beforehand  which  is 
the  underlying  tessellation  (as  we  did  in  the  case  of  synthetic 
images)  and  therefore  which  small  components  to  discard  due  to 
thresholding  errors.  We  also  don’t  know  which  separate  components 
close  to  the  picture  border  must  be  considered  as  one  because 
they  are  joined  beyond  the  image  border.  The  assumption  of  in¬ 
dependence  of  gray  levels  at  adjacent  points  of  the  picture,  or 
even  at  distance  2  apart,  is  not  always  correct;  this  was  checked 
in  the  case  of  the  marble  picture  (Brodatz  162) .  When  blur  is 


present ,  perimeter  is  not  as  well  defined  as  in  sharp  images, 
so  q  is  not  accurate. 


In  summary ,  computing  expected  edge  value,  assuming  gener- 
at  ion  of  the  image  by  a  random  mosaic  process,  cannot  be  expected 
to  yield  accurate  results. 


•J .  Expected  width 

The  expected  widths  (run  lengths)  of  connected  components 
m  synthetic  images  is  analyzed  in  [6).  Experiments  were  done 
t  test  the  validity  of  the  following  formulas  for  two-color 
m  ;  nos: 

TT 

Poisson  line:  E(f)  ■  •xzrn — t  where  T  ■ 

2T ( 1 -p) 

t  =  run  length  in  a  component 
p  and  i  are  defined  in  Section  2. 

n 

Occupancy :  E  <  < )  4/y  ( l-p) 

3-2 

Delaunay:  E(f)  =  - 

64A(l-p) 

Note  that  these  formulas  are  singular  for  p~ 1  while  the  experi- 
mental  width  approaches  the  image  frame  width  for  p  values  near 
1.  Therefore  the  validity  of  the  above  formulas  should  be 
tested  for  low  p  values.  In  fact  p-0.3  was  used  in  the  following 
ex:  e  r  l  me  n  t  s . 

The  results  for  four  occupancy  and  Delaunay  models  are  given 
in  Tables  6  and  7.  For  the  marble  picture  (flrodatz's  *62)  the 
results  in  Table  B  show  insensitivity  to  threshold  variations: 

2.2»  between  the  various  models.  As  in  Section  2,  we  use  for 
the  calculations  an  internal  frame  of  a  tessellation  which  contains 
cells  wholly  included  in  the  original  frame.  Components  touchinq 
the  border  in  the  internal  frame  are  considered  for  run  length 
computation,  since  it  is  believed  that,  on  the  average,  the 


Table  6: 


Synthet ic  Occupancy 


Mosaic 

number 

X 

P 

Observed 

width 

Computed 

width 

Deviat ion 

.003 

f  °-3  ] 

21 .402 

19.47 

2 

.003 

0 . 3 

20.181 

19.47 

3 

.003 

0. 3 

19 . 066 

| 

19.47 

•1 

_ L 

.003 

0.  3 

- 

19.359 

U  -  -i  J 

19.47 

Average 

.003 

0.3 

20.002 

19.47 

2.65% 

Table  7:  Synthetic  Delaunay 


Mosaic 

number 

1  X  1 

0 

[ _ _ 

P  j 

Obse • ved 
width 

Com} uted 
w  i  d  t  h 

Dev i at  ion 

1 

.  002 

r  o.  3 

18.01 

14.76 

“  1 

d. 

.002 

0.3 

15.00 

14.76 

3 

.002 

0.3 

15.27 

14.76 

4 

.002  i 

0.3 

16.97 

14 .76 

Averaqe 

.002 

0.3  ! 

16.  ’'1 

14.76 

10.5% 

correct  width  of  »  border-touching  component  is  encountered 
inside  the  image. 

As  in  the  case  of  perimeter,  the  Poisson  line  process  best 
predicts  the  observed  run  length  of  a  component  in  the  marble 
:  icture  »f2).  In  fact  it  is  four  times  better  than  the  predictions 
of  the  other  two  mosaic  models.  Moreover,  in  the  width  case  the 
deviation  between  the  observed  and  predicted  value  is  at  most 
1  *  (Poisson  model),  which  is  much  less  than  the  deviation  in 
the  perimeter  case  for  the  Poisson  model  (34'*)  . 


5.  Conclusions 


a)  Results  for  expected  perimeter  in  two-color  synthetic 
tessellations  are  in  very  good  agreement  with  observations. 

b)  Results  for  real  images  are  not  very  good.  Some  models 
?it  an  order  of  magnitude  better  than  others,  but  reliable  con¬ 
clusions  can  be  drawn  only  after  trying  more  real  images.  Using 
perimeter  evaluation  for  estimating  the  edge  value  density  of 
real  images  will  not  be  accurate,  due  to  its  dependence  on  the 
thresholding  process  which  is  not  accurately  defined. 

c)  For  real  images,  where  thresholds  are  used  to  distinguish 
between  ob}ects  and  background,  the  width  is  much  less  sensitive 
than  the  perimeter  to  variations  in  the  threshold,  and  therefore 
is  a  much  better  measure  for  determining  the  generating  tessel¬ 
lation. 
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